659 research outputs found
Using Machine Learning to Identify Patterns in Learner-Submitted Code for the Purpose of Assessment
Programming has become an important skill in today’s world and is taught widely both in traditional and online settings. Instructors need to grade increasing amounts of student work. Unit testing can contribute to the automation of the grading process but it cannot assess the structure or partial correctness of code, which is needed for finely differentiated grading. This paper builds on previous research that investigated machine learning models for determining the correctness of programs from token-based features of source code and found that some such models can be successful in classifying source code with respect to whether it passes unit tests. This paper makes two further contributions. First, these results are scrutinized under conditions of varying similarity between code instances used for model training and testing, for a better understanding of how well the models generalize. It was found that the models do not generalize outside of groups of code instances performing very similar tasks (corresponding to similar coding assignments). Second, selected binary classification models are used as a base for multi-class prediction with two different methods. Both of these exhibit prediction success well above the random baseline, with potential to contribute to automated assessment with multi-valued measures of quality (grading schemes), in contrast to the binary pass/fail measure associated with unit testing
Pacloud: Towards a Universal Cloud-based Linux Package Manager
Package managers are a very important part of Linux distributions but we have
noticed two weaknesses in them: They use pre-built packages that are not
optimised for specific hardware and often they are too heavy for a specific
need, or packagesmay require plenty of time and resources to be compiled. In
this paper, we present a novel Linux package manager which uses cloud computing
features to compile and distribute Linux packages without impacting the end
user's performance. We also show how Portage, Gentoo's package manager can be
optimised for customisation and performance, along with the cloud computing
features to compile Linux packages more efficiently. All of this resulting in a
new cloud-based Linux package manager that is built for better computing
performance.Comment: Presented at CECCC 2019. 8 pages, 13 figure
Clustering Weblogs on the Basis of a Topic Detection Method
In recent years we have seen a vast increase in the volume of information published on weblog sites and also the creation of new web technologies where people discuss actual events. The need for automatic tools to organize this massive amount of information is clear, but the particular characteristics of weblogs such as shortness and overlapping vocabulary make this task difficult. In this work, we present a novel methodology to cluster weblog posts according to the topics discussed therein. This methodology is based on a generative probabilistic model in conjunction with a Self-Term Expansion methodology. We present our results which demonstrate a considerable improvement over the baseline
Prototype/topic based Clustering Method for Weblogs
[EN] In the last 10 years, the information generated on weblog sites has increased exponentially, resulting in a clear need for intelligent approaches to analyse and organise this massive amount of information. In this work, we present a methodology to cluster weblog posts according to the topics discussed therein, which we derive by text analysis. We have called the methodology Prototype/Topic Based Clustering, an approach which is based on a generative probabilistic model in conjunction with a Self-Term Expansion methodology. The usage of the Self-Term Expansion methodology is to improve the representation of the data and the generative probabilistic model is employed to identify relevant topics discussed in the weblogs. We have modified the generative probabilistic model in order to exploit predefined initialisations of the model and have performed our experiments in narrow and wide domain subsets. The results of our approach have demonstrated a considerable improvement over the pre-defined baseline and alternative state of the art approaches, achieving an improvement of up to 20% in many cases. The experiments were performed on both narrow and wide domain datasets, with the latter showing better improvement. However in both cases, our results outperformed the baseline and state of the art algorithms.The work of the third author was carried out in the framework of the WIQ-EI IRSES project (Grant No. 269180) within the FP7 Marie Curie, the DIANA APPLICATIONS Finding Hidden Knowledge in Texts: Applications (TIN2012-38603-C02-01) project and the VLC/CAMPUS Microcluster on Multimodal Interaction in Intelligent Systems.Perez-Tellez, F.; Cardiff, J.; Rosso, P.; Pinto Avendaño, DE. (2016). Prototype/topic based Clustering Method for Weblogs. Intelligent Data Analysis. 20(1):47-65. https://doi.org/10.3233/IDA-150793S476520
Weblog and short text feature extraction and impact on categorisation
The characterisation and categorisation of weblogs and other short texts has become an important research theme in the
areas of topic/trend detection, and pattern recognition, amongst others. The value of analysing and characterising short text is to
understand and identify the features that can identify and distinguish them, thereby improving input to the classification process.
In this research work, we analyse a large number of text features and establish which combinations are useful to discriminate
between the different genres of short text. Having identified the most promising features, we then confirm our findings by
performing the categorisation task using three approaches: the Gaussian and SVM classifiers and the K-means clustering algorithm.
Several hundred combinations of features were analysed in order to identify the best combinations and the results confirmed the
observations made. The novel aspect of our work is the detection of the best combination of individual metrics which are identified
as potential features to be used for the categorisation process.The research work of the third author is partially funded by the WIQ-EI (IRSES grant n. 269180) and DIANA APPLICATIONS (TIN2012-38603-C02-01), and done in the framework of the VLC/Campus Microcluster on Multimodal Interaction in Intelligent Systems.Perez Tellez, F.; Cardiff, J.; Rosso, P.; Pinto Avendaño, DE. (2014). Weblog and short text feature extraction and impact on categorisation. Journal of Intelligent and Fuzzy Systems. 27(5):2529-2544. https://doi.org/10.3233/IFS-141227S2529254427
Selenium and impaired physical function in US and Spanish older adults
Background: Selenium (Se) is a trace element with a narrow safety margin.
Objectives: To evaluate the cross-sectional and longitudinal dose-response association between Se exposure and
measures of impaired physical function and disability in older adults.
Design: NHANES 2011–2014 cross-sectional (US, n = 1733, age ≥60 years) and Seniors-ENRICA-2 2017–2019
cross-sectional and longitudinal (Spain, n = 2548 and 1741, respectively, age ≥65 years) data were analyzed.
Whole blood and serum Se levels were measured using inductively coupled plasma-mass spectrometry. Lowerextremity
performance was assessed with the Short Physical Performance Battery, and muscle weakness with
a dynamometer. Incident mobility and agility limitations, and disability in instrumental activities of daily living
(IADL) were ascertained with standardized questionnaires. Analyses were adjusted for relevant confounders,
including physical activity. Results across studies were pooled using random-effects meta-analysis.
Results: Meta-analyzed odds ratios (95% confidence interval) per log2 increase in whole blood Se were 0.54
(0.32; 0.76) for weakness, 0.59 (0.34; 0.83) for impaired lower-extremity performance, 0.48 (0.31; 0.68) for
mobility limitations, 0.71 (0.45; 0.97) for agility limitations, and 0.34 (0.12; 0.56) for disability in at least one
IADL. Analyses for serum Se in NHANES showed similar results. Findings suggest the inverse association with
grip strength is progressive below 140 μg/L (p-value for non-linear trend in the Seniors-ENRICA-2 study = 0.13),
and above 140 μg/L (p-value for non-linear trend in NHANES = 0.11). In the Seniors-ENRICA-2 cohort, with a
2.2 year follow-up period, a doubling in baseline Se levels were associated with a lower incidence of weakness
[odds ratio (95% confidence interval): 0.45 (0.22; 0.91)], impaired lower-extremity performance [0.63 (0.32;
1.23)], mobility [0.43 (0.21; 0.91)] and agility [0.38 (0.18; 0.78)] limitations.
Discussion: In US and Spanish older adults, Se concentrations were inversely associated with physical function
limitations. Further studies are needed to elucidate underlying mechanisms.Instituto de Salud Carlos III
European Commission
PI18/287
16/609State Secretary of R + D + I
PID2019-108973RB-C21/C22European Social Fund (ESF)
European Commissio
Dietary determinants of cadmium exposure in the Strong Heart Family Study
Urinary cadmium (Cd) concentrations in the Strong Heart Family Study (SHFS) participants are higher than in the general US population. This difference is unlikely to be related to tobacco smoking. We evaluated the association of consumption of processed meats and other dietary products with urinary Cd concentrations in the SHFS, a family-based study conducted in American Indian communities. We included 1725 participants with urine Cd concentrations (standardized to urine creatinine) and food frequency questionnaire data grouped in 24 categories, including processed meat. Median (IQR) urinary Cd concentrations were 0.42 (0.20–0.85) μg/g creatinine. The age, sex, smoking, education, center, body mass index, and total kcal adjusted geometric mean ratio (GMR) (95%CI) of urinary cadmium concentrations per IQR increase in each dietary category was 1.16 (1.04–1.29) for processed meat, 1.10 (1.00–1.21) for fries and chips, 0.87 (0.80–0.95) for dairy products, and 0.89 (0.82–0.97) for fruit juices. The results remained similar after further adjustment for the dietary categories associated with urinary Cd in the previous model except for fries and chips, which was no longer statistically significant. These findings revealed the potential importance of processed meat products as a dietary source of cadmium
Recommended from our members
Healthy lifestyle, metabolomics and incident type 2 diabetes in a population-based cohort from Spain
Background: The contribution of metabolomic factors to the association of healthy lifestyle with type 2 diabetes risk is unknown. We assessed the association of a composite measure of lifestyle with plasma metabolite profiles and incident type 2 diabetes, and whether relevant metabolites can explain the prospective association between healthy lifestyle and incident type 2 diabetes. Methods: A Healthy Lifestyle Score (HLS) (5-point scale including diet, physical activity, smoking status, alcohol consumption and BMI) was estimated in 1016 Hortega Study participants, who had targeted plasma metabolomic determinations at baseline examination in 2001-2003, and were followed-up to 2015 to ascertain incident type 2 diabetes. Results: The HLS was cross-sectionally associated with 32 (out of 49) plasma metabolites (2.5% false discovery rate). In the subset of 830 participants without prevalent type 2 diabetes, the rate ratio (RR) and rate difference (RD) of incident type 2 diabetes (n cases = 51) per one-point increase in HLS was, respectively, 0.69 (95% CI, 0.51, 0.93), and - 8.23 (95% CI, - 16.34, - 0.13)/10,000 person-years. In single-metabolite models, most of the HLS-related metabolites were prospectively associated with incident type 2 diabetes. In probit Bayesian Kernel Machine Regression, these prospective associations were mostly driven by medium HDL particle concentration and phenylpropionate, followed by small LDL particle concentration, which jointly accounted for ~ 50% of the HLS-related decrease in incident type 2 diabetes. Conclusions: The HLS showed a strong inverse association with incident type 2 diabetes, which was largely explained by plasma metabolites measured years before the clinical diagnosis.This work was supported by the Strategic Action for Research in Health sciences [PI10/0082, PI13/01848, PI14/00874, PI16/01402, PI11/00726, PI16/609, PI16/1512, PI18/287, PI19/319 and PI20/00896], the GUTMOM Project (JPI-A Healthy Diet for a Healthy Life INTIMIC-085, State Secretary of R + D + I PCIN-2017-117), the Cátedra de Epidemiología y Control del Riesgo Cardiovascular at UAM (#820024), the State Agency for Research (PID2019-108973RB-C21 and C22), the Valencia Government (GRUPOS 03/101; PROMETEO/2009/029 and ACOMP/2013/039), the Castilla-Leon Government (GRS/279/A/08) and European Network of Excellence Ingenious Hypercare (EPSS- 037093) from the European Commission; CIBER Fisiopatología Obesidad y Nutrición (CIBEROBN) (CIBER-02-08-2009, CB06/03 and CB12/03/30016). MSP holds a Ramón y Cajal contract (RYC-2018-025069-I) from the Ministry of Science, Innovation and Universities. MDV holds a “Predoctoral Training in Health Research” contract (FI20/00162) from the Carlos III Health Institute. MGP and ADR received the support of a fellowship from “la Caixa” Foundation (ID 100010434, fellowship codes LCF/BQ/IN18/11660001, and LCF/BQ/DR19/11740016, respectively). PO received the support of a Sara Borrell contract from the Carlos III Health Institute (reference CD16/00255). The Strategic Action for Research in Health Sciences, CIBEROBN are initiatives from Carlos III Health Institute Madrid and co-funded by the European Social Fund “The ESF - investing in your future”. The State Agency for Research and Carlos III Health Institute belong to the Spanish Ministry of Science and Innovation. The funding bodies had no role in the study design, data collection and analysis, interpretation of results, manuscript preparation or in the decision to submit this manuscript for publication.S
Ciencias de la Biología y Agronomía
Este volumen I contiene 17 capítulos arbitrados que se ocupan de estos asuntos en Tópicos Selectos de Ciencias de la Biología y Agronomía, elegidos de entre las contribuciones, reunimos algunos investigadores y estudiantes. Se presenta un Estudio Comparativo de los Recursos Hidrológico-Forestales de la Microcuenca de la Laguna de Epatlan, Pue. (1993 a 2014); la Situación Actual de la Mancha de Asfalto en Maíz (Zea mays L.) en los Municipios de Jiquipilas y Ocozocoautla, Chiapas, México; las poblaciones sobresalientes de maíz de la raza Zapalote Chico, en la Región Istmeña de Oaxaca; Se indica el índice de área foliar de cultivo de Chile Poblano mediante dos métodos en condiciones protegidas; Esquivel, Urzúa y Ramírez exploran el efecto de la biofertilización con Azospirillum en el crecimiento y producción de Jitomate; esbozan su artículo sobre la determinación del nivel de Heterosis en híbridos de Maíz para la Comarca Lagunera; una investigación sobre la estabilización de semilla de Solanum lycopersicum durante el almacenamiento y estimulación de la germinación; acotan sobre el CTAB como una nueva opción para la detección de Huanglongbing en cítricos, plantean su evaluación sobre el aluminio y cómo afecta la vida de florero de Heliconia psittacorum; indican sobre el impacto del H-564C, como un híbrido de maíz con alta calidad de proteina para el trópico húmedo de México; presetan su investigación sobre la producción de Piña Cayena Lisa y MD2 (Ananas comosus L.) en condiciones de Loma Bonita, en Oaxaca; acotan sobre el efecto de coberteras como control biológico por conservación contra áfidos en Nogal Pecanero; esbozan sobre la caracterización de cuatro genotipos de Frijol Negro en Martínez de la Torre, Veracruz, México; presentan una caracterización hidroecológica de la microcuenca de Arroyo Prieto, Yuriría, Gto., y alternativas para su restauración ambiental; presentan su investigación sobre el efecto del hongo Beauveria bassiana sobre solubilización de fosfatos y la disponibilidad de fósforo en el suelo; plantean su investigación sobre la Germinación y regeneración in vitro de Epidendrum falcatum LINDL; esbozan su artículo sobre genotipos de frijol negro y su tolerancia a sequía terminal en Veracruz, México
Severe Asthma Standard-of-Care Background Medication Reduction With Benralizumab: ANDHI in Practice Substudy
Background: The phase IIIb, randomized, parallel-group, placebo-controlled ANDHI double-blind (DB) study extended understanding of the efficacy of benralizumab for patients with severe eosinophilic asthma. Patients from ANDHI DB could join the 56-week ANDHI in Practice (IP) single-arm, open-label extension substudy. Objective: Assess potential for standard-of-care background medication reductions while maintaining asthma control with benralizumab. Methods: Following ANDHI DB completion, eligible adults were enrolled in ANDHI IP. After an 8-week run-in with benralizumab, there were 5 visits to potentially reduce background asthma medications for patients achieving and maintaining protocol-defined asthma control with benralizumab. Main outcome measures for non-oral corticosteroid (OCS)-dependent patients were the proportions with at least 1 background medication reduction (ie, lower inhaled corticosteroid dose, background medication discontinuation) and the number of adapted Global Initiative for Asthma (GINA) step reductions at end of treatment (EOT). Main outcomes for OCS-dependent patients were reductions in daily OCS dosage and proportion achieving OCS dosage of 5 mg or lower at EOT. Results: For non-OCS-dependent patients, 53.3% (n = 208 of 390) achieved at least 1 background medication reduction, increasing to 72.6% (n = 130 of 179) for patients who maintained protocol-defined asthma control at EOT. A total of 41.9% (n = 163 of 389) achieved at least 1 adapted GINA step reduction, increasing to 61.8% (n = 110 of 178) for patients with protocol-defined EOT asthma control. At ANDHI IP baseline, OCS dosages were 5 mg or lower for 40.4% (n = 40 of 99) of OCS-dependent patients. Of OCS-dependent patients, 50.5% (n = 50 of 99) eliminated OCS and 74.7% (n = 74 of 99) achieved dosages of 5 mg or lower at EOT. Conclusions: These findings demonstrate benralizumab's ability to improve asthma control, thereby allowing background medication reduction
- …